26 research outputs found

    The Blogosphere at a Glance — Content-Based Structures Made Simple

    Get PDF
    A network representation based on a basic wordoverlap similarity measure between blogs is introduced. The simplicity of the representation renders it computationally tractable, transparent and insensitive to representation-dependent artifacts. Using Swedish blog data, we demonstrate that the representation, in spite of its simplicity, manages to capture important structural properties of the content in the blogosphere. First, blogs that treat similar subjects are organized in distinct network clusters. Second, the network is hierarchically organized as clusters in turn form higher-order clusters: a compound structure reminiscent of a blog taxonomy

    A method for inferring hierarchical dynamics in stochastic processes

    Full text link
    Complex systems may often be characterized by their hierarchical dynamics. In this paper do we present a method and an operational algorithm that automatically infer this property in a broad range of systems; discrete stochastic processes. The main idea is to systematically explore the set of projections from the state space of a process to smaller state spaces, and to determine which of the projections that impose Markovian dynamics on the coarser level. These projections, which we call Markov projections, then constitute the hierarchical dynamics of the system. The algorithm operates on time series or other statistics, so a priori knowledge of the intrinsic workings of a system is not required in order to determine its hierarchical dynamics. We illustrate the method by applying it to two simple processes; a finite state automaton and an iterated map.Comment: 16 pages, 12 figure

    Autonomous Accident Monitoring Using Cellular Network Data

    Get PDF
    Mobile communication networks constitute large-scale sensor networks that generate huge amounts of data that can be refined into collective mobility patterns. In this paper we propose a method for using these patterns to autonomously monitor and detect accidents and other critical events. The approach is to identify a measure that is approximately time-invariant on short time-scales under regular conditions, estimate the short and long-term dynamics of this measure using Bayesian inference, and identify sudden shifts in mobility patterns by monitoring the divergence between the short and long-term estimates. By estimating long-term dynamics, the method is also able to adapt to long-term trends in data. As a proof-of-concept, we apply this approach in a vehicular traffic scenario, where we demonstrate that the method can detect traffic accidents and distinguish these from regular events, such as traffic congestions

    Quasi-Species and Aggregate Dynamics

    Full text link
    At an early stage in pre-biotic evolution, groups of replicating molecules must coordinate their reproduction to form aggregated units of selection. Mechanisms that enable this to occur are currently not well understood. In this paper we introduce a deterministic model of primitive replicating aggregates, proto-organisms, that host populations of replicating information carrying molecules. Some of the molecules promote the reproduction of the proto-organism at the cost of their individual replication rate. A situation resembling that of group selection arises. We derive and analytically solve a partial differential equation that describes the system. We find that the relative prevalence of fast and slow replicators is determined by the relative strength of selection at the aggregate level to the selection strength at the molecular level. The analysis is concluded by a preliminary treatment of finite population size effects.Comment: 6 page

    A model-independent approach to infer hierarchical codon substitution dynamics

    Get PDF
    <p>Abstract</p> <p>Background</p> <p>Codon substitution constitutes a fundamental process in molecular biology that has been studied extensively. However, prior studies rely on various assumptions, e.g. regarding the relevance of specific biochemical properties, or on conservation criteria for defining substitution groups. Ideally, one would instead like to analyze the substitution process in terms of raw dynamics, independently of underlying system specifics. In this paper we propose a method for doing this by identifying groups of codons and amino acids such that these groups imply closed dynamics. The approach relies on recently developed spectral and agglomerative techniques for identifying hierarchical organization in dynamical systems.</p> <p>Results</p> <p>We have applied the techniques on an empirically derived Markov model of the codon substitution process that is provided in the literature. Without system specific knowledge of the substitution process, the techniques manage to "blindly" identify multiple levels of dynamics; from amino acid substitutions (via the standard genetic code) to higher order dynamics on the level of amino acid groups. We hypothesize that the acquired groups reflect earlier versions of the genetic code.</p> <p>Conclusions</p> <p>The results demonstrate the applicability of the techniques. Due to their generality, we believe that they can be used to coarse grain and identify hierarchical organization in a broad range of other biological systems and processes, such as protein interaction networks, genetic regulatory networks and food webs.</p

    Cross-lingual comparison between distributionally determined word similarity networks

    Get PDF
    As an initial effort to identify universal and language-specific factors that influence the behavior of distributional models, we have formulated a distributionally determined word similarity network model, implemented it for eleven different languages, and compared the resulting networks. In the model, vertices constitute words and two words are linked if they occur in similar contexts. The model is found to capture clear isomorphisms across languages in terms of syntactic and semantic classes, as well as functional categories of abstract discourse markers. Language specific morphology is found to be a dominating factor for the accuracy of the model

    Hierarchical self-organization in the finitary process soup

    No full text
    Current analyses of genomes from numerous species show that the diversity of organism’s functional and behavioral characters is not proportional to the number of genes that encode the organism. We investigate the hypothesis that the diversity of organismal character is due to hierarchical organization. We do this with the recently introduced model of the finitary process soup, which allows for a detailed mathematical and quantitative analysis of the population dynamics of structural complexity. Here we show that global complexity in the finitary process soup is due to the emergence of successively higher levels of organization, that the hierarchical structure appears spontaneously, and that the process of structural innovation is facilitated by the discovery and maintenance of relatively noncomplex, but general individuals in a population
    corecore